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ABSTRACT: Digital hydrologic networks depicting surface-water pathways and their associated drainage catch- 
ments provide a key component to hydrologic analysis and modeling. Collectively, they form common spatial 
units that can be used to frame the descriptions of aquatic and watershed processes. In addition, they provide 
the ability to simulate and route the movement of water and associated constituents throughout the landscape. 
Digital hydrologic networks have evolved from derivatives of mapping products to detailed, interconnected, spa- 
tially referenced networks of water pathways, drainage areas, and stream and watershed characteristics. These 
properties are important because they enhance the ability to spatially evaluate factors that affect the sources 
and transport of water-quality constituents at various scales. SPAtially Referenced Regressions On Watershed 
attributes (SPARROW), a process-based/statistical model, relies on a digital hydrologic network in order to 
establish relations between quantities of monitored contaminant flux, contaminant sources, and the associated 
physical characteristics affecting contaminant transport. Digital hydrologic networks modified from the River 
Reach File (RF1) and National Hydrography Dataset (NHD) geospatial datasets provided frameworks for SPAR- 
ROW in six regions of the conterminous United States. In addition, characteristics of the modified RF1 were 
used to update estimates of mean-annual streamflow. This produced more current flow estimates for use in 
SPARROW modeling. 
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INTRODUCTION 



Processes controlling the supply, fate, and 
transport of chemical and organic constituents in 
terrestrial and aquatic systems occur throughout 
a watershed, from the headwater areas to the 



downstream receiving waters (Howarth et al., 1996; 
Seitzinger et al., 2002; Van Breemen et al., 2002; 
McClain etal, 2003; Alexander et al, 2007). Spa- 
tially distributed physical characteristics are often 
used to describe such processes and conditions. These 
descriptions help formulate a better understanding of 
the intrinsic connections between land, water, and 
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their interactions (Alexander et al., 2007). A funda- 
mental task prior to any quantitative approach to 
analyzing and validating these associations is defin- 
ing logically connected spatial units that frame aqua- 
tic and watershed characterizations. Once developed, 
a consistent framework can be used to capture, store, 
and analyze relations and watershed characteriza- 
tions within a geographically referenced network. 
Additionally, this information can be applied to 
hydrologic modeling applications designed to identify 
and investigate the spatial and temporal relations 
between the constituents and the processes that 
affect transport. Evaluating results of these modeling 
applications can contribute to the understanding of 
conditions and the management of activities related 
to the processes within the system (Kondolf et al. , 
2002; Driscoll et al., 2003; Grimm et al., 2003; 
Alexander et al., 2007; Gong et al, 2009). 

A digital hydrologic network of connected surface- 
water pathways and the areas they drain can be used 
as a foundation for a consistent spatial framework to 
characterize and analyze watershed processes. Dis- 
crete spatial units can help delineate, visualize, and 
spatially reference physical properties of a watershed 
system. These properties include landscape, aquatic 
and subsurface watershed characteristics such as 
contaminant supply, slope, soil characteristics, and 
annual streamflow. Hydrologic connectivity informa- 
tion from a digital network is equally important. The 
structure can facilitate the ability to simulate the 
movement of water and associated constituents 
within the system. This permits spatial analysis 
upstream and downstream relative to any location 
along a surface-water pathway. Additionally, infor- 
mation depicting the geographic connectivity provides 
the ability to establish and assess any spatial and 
temporal relations that may exist between the inter- 
actions of the associated watershed characteristics, 
the flow of water over the landscape, and within 
aquatic systems. 

In this article, we provide a historical perspective 
of digital hydrologic networks and the origins of geo- 
spatial data that support the development and evolu- 
tion of such networks. We also describe the important 
roles networks play in providing a spatial infrastruc- 
ture for supporting hydrologic-transport models such 
as SPAtially Referenced Regressions On Watershed 
attributes (SPARROW). Modified versions of the 
River Reach File (RF1) (Horn et al., 1994; USEPA, 
1996) and the National Hydrography Dataset (NHD) 
(USGS, 1999b) are emphasized and discussed in the 
context of SPARROW model applications. These mod- 
els have been developed in six regions of the conter- 
minous United States and are presented in this issue. 
In addition, we describe an application that utilizes 
topological properties of a digital hydrologic network 



to estimate mean-annual streamflow in unmonitored 
stream reaches. This application produced more 
current flow estimates specified in the regional 
SPARROW models. 

Using geographic information systems (GIS), spa- 
tially referencing various natural and human-related 
watershed characteristics to a digital network allows 
for the rapid display and analysis of the geographic 
distributions. Relative quantities and the factors 
related to supply, fate, and constituent transport 
can be evaluated in geographic detail. Spatially 
referenced point locations along the network where 
direct measurements have been collected over time 
(instream monitoring) provide the means to discern 
multiple relationships between watershed character- 
istics, processes, and the observations. These relation- 
ships can then be evaluated in both space and in 
time. Watershed characteristics and process descrip- 
tions and associations can also be utilized in a variety 
of applications to further determine and assess the 
connections and the processes controlling fate and 
transport. These applications include flux-based 
water-quality transport models and methods to esti- 
mate stream characteristics like flow and velocity in 
ungaged locations. 

Many hydrologic models describe some aspect of the 
physical properties of the landscape. This includes the 
movement of mass in space and/or the change of mass 
in time. Models also can be designed to establish rela- 
tions between water-quality monitoring, the supply of 
contaminants, and the natural attenuation processes 
that occur in transport across the landscape and 
within water pathways. Interpretations of model 
results can be useful at broad spatial and temporal 
scales to help address a variety of environmental- 
management decisions, including the design and track- 
ing of contaminant reduction and protection strategies, 
monitoring practices and priorities, stream-health 
assessments, and regulatory requirements such as 
total maximum daily loads (TMDLs) (Schwarz etal., 
2006; Preston etal., 2009). SPARROW is one such 
model that integrates monitoring and modeling. The 
watershed-modeling approach uses nonlinear statisti- 
cal methods to define conceptual and spatial relations 
among quantities of contaminant sources, monitored 
contaminant flux, aquatic transport processes, and the 
physical characteristics that potentially affect contam- 
inant transport to and within streams (Smith et al. , 
1997; Preston et al., 2009). A digital representation of 
a hydrologic network provides the fundamental frame- 
work for the spatial infrastructure supporting SPAR- 
ROW models (Schwarz et al., 2006). A linear network 
of stream reaches and associated drainage areas collec- 
tively form this basic foundation for spatially referenc- 
ing monitored and predicted stream flux, quantities 
of potential contaminant sources, and stream and 
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watershed characteristics to individual stream reaches 
and drainage areas (Schwarz et al., 2006). This infra- 
structure also allows for a comprehensive, quantitative 
assessment of landscape characterizations and the 
relationships to water-quality conditions and the pro- 
cesses controlling supply and transport of constituents 
over a broad spatial domain rather than just at point 
locations where monitoring data are collected. 



ORIGINS OF DIGITAL HYDROLOGIC NETWORKS 



Many digital hydrologic networks used within the 
United States at regional scales have been derived 
from information collected by national mapping pro- 
grams. The conterminous United States was mapped 
by topographic quadrangle at increasingly larger 
scales (i.e., finer detail) over a 100-year period until 
the early 1990s, when l:24,000-scale mapping was 
completed (Kelmelis et al., 2003). Mapping included, 
among other topics, the geographic locations of hydro- 
logic features and interpreted elevation contours. The 
computer age facilitated a transition from maintain- 
ing and revising paper topographic maps to more effi- 
cient automated digital map-production procedures 
(Thompson, 1988). The National Digital Cartographic 
Data Base (NDCDB) was established by the U.S. 
Geological Survey (USGS) to distribute digital 
map data that adhere to map-production standards 
(McEwen and Jacknow, 1980). Digital line graph 
(DLG) files were one of the first national representa- 
tions of cartographic hydrologic features in digital 
form available to the general public (USGS, 1989). 
DLG files are digital representations of selected car- 
tographic data typically displayed on published USGS 
topographic quadrangle and sectional maps. The data 
structure of the DLGs maintained spatial relations of 
hydrologic features such as connectivity and adja- 
cency between linear and areal features that allowed 
for simple plotting or analysis of their spatial rela- 
tions (USGS, 1989). The files originated from small 
(1:2,000,000) to intermediate (1:100,000) scales, to 
large (1:24,000) scale topographic map series gener- 
ated and revised over the last century. The NDCDB 
also contained digital elevation models (DEMs) 
derived from matrices of elevation points spaced at 
regular distances. Like the mapping programs, eleva- 
tion points were compiled using various methods that 
have progressed over time and been produced at 
several spatial resolutions (USGS, 1987). 

Linking geographic information to digital represen- 
tations of hydrologic features is a logical progression 
from maps to GIS and subsequent frameworks sup- 
porting hydrologic modeling (Maidment, 1993; Brilly 



et al., 1993). Early DLG files representing hydrology 
included an attempt to describe physical features of 
the mapped hydrography using a series of numerical 
codes. These codes were successful in identifying the 
cartographic features (like intermittent and perennial 
streams or rivers) once visualized on a paper map, 
but were limited in the ability to describe additional 
characteristics of the mapped feature. In addition, 
maintaining the hydrologic direction of flow of 
streams and rivers was subjectively determined at 
the time of data capture (USGS, 1989). Cartographic 
interpretations subject to human judgment in topo- 
graphic map production (Thompson, 1988) and the 
physical condition (wetness of a particular year a sur- 
vey was made) contributed to varying stream densi- 
ties from quadrangle map to quadrangle map 
(Langbein, 1947). This inconsistency transferred to 
the digital products, thus challenging consistent 
regional assessments of hydrology. 

In the early 1970s, the U.S. Environmental Protec- 
tion Agency (USEPA) developed a hydrographic data- 
base of surface waters (RF1) for the conterminous 
United States (Horn et al, 1994; USEPA, 1996). 
Designed to establish hydrologic ordering for naviga- 
tion and modeling purposes, the river reach files pro- 
gressed through numerous versions in an attempt to 
provide a common spatial framework that could be 
used to integrate numerous environmentally signifi- 
cant surface-water databases at reasonable spatial 
scales (Horn et al. , 1994; USEPA, 1996). The initial 
spatial framework consisted of digitized hydrologic 
features from National Oceanographic and Atmo- 
spheric Administration (NOAA) aeronautical charts at 
a 1:500,000 map scale (photo reduced from 1:250,000) 
and associated attributes contained in a single table. 
Representing the most detailed national hydrologic 
digital network of the time, RF1 introduced the con- 
cept of digital reach indexing for more than 60,000 
streams nationally. Indexing allowed for a consistent 
framework that could take advantage of spatially ref- 
erenced hydrologic information using a unique coding 
system and topological ordering of each stream reach 
for navigation and transport applications. Early appli- 
cations included streamflow and velocity estimations, 
water-quality modeling, relocating point-source loca- 
tions, and water-quality monitoring associations in 
support of the Clean Water Act (Horn et al., 1994). 
However, some areas of the country exhibit unnatural 
differences in stream densities and spatial detail. In 
some instances, reaches were inappropriately con- 
nected at various locations, which could contribute to 
unexpected results. The physical locations of some 
surface-water pathways in RF1 are also suspect based 
on a comparison with landscape features such as ele- 
vation. Additional associated physical characteristics 
such as reach slope may be difficult to accurately 
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calculate because of these discrepancies in location. 
In recent times, RF1 has been modified to address 
some of these deficiencies (see Supporting Information 
for details). In addition, catchments representing 
drainage areas for each mapped reach have been gen- 
erated from various resolutions ranging from 30 m to 
1 km cell sizes (Nolan et al., 2002; Brakebill and Pres- 
ton, 2003). Modified versions of RF1 and associated 
catchments have supported specific regional and 
national transport-modeling applications describing 
watershed conditions and are still widely used today 
(Horn et al., 1994; Smith et al., 1997; Alexander et al., 
1999; Preston and Brakebill, 1999; Nolan et al., 2002; 
McMahon et al., 2003; USEPA, 2007; Alexander et al., 
2008; Brakebill et al., 2010). 

The success of RF1 prompted the development of a 
more comprehensive hydrologic database. Reach File 
Version 3.0 (RF3) addressed the desire for more 
detailed hydrologic features at finer scales. Geo- 
graphic representations of hydrologic features in RF3 
were based on USGS 1:100,000 DLGs and provided a 
more detailed database for national, regional, and 
local reporting requirements such as those found 
in 305(b) sections of the Clean Water Act. Over 
3,000,000 naturally flowing streams and constructed 
water bodies were represented nationally in RF3, a 
considerable increase from RF1 (Horn et al., 1994). 

RF3 and DLGs have since been incorporated into a 
NHD (Simley and Carswell, 2009). NHD maintains 
the ability to map hydrologic features in addition to 
an improved address system for spatial referencing 
and topological networking for stream navigation. 
NHD is currently being produced and distributed at 
medium (l:100,000-scale) and high (l:24,000-scale) 
resolutions (Simley and Carswell, 2009). NHD also 
supports systematic exchange, updates, and improve- 
ments to the data (USGS, 1999b). Specific protocols 
exist for updating features and related information. 
Stewards of the datasets are coordinated through 
partnerships among multiple federal, state, and local 
agencies. This coordination can potentially minimize 
the duplication of effort among stewards and provide 
a consistent, useful dataset for modeling and other 
water-resource applications. 

A multi-agency effort has incorporated and 
expanded the capabilities of the medium-resolution 
NHD. NHDPlus (USEPA and USGS, 2005, 2009) is 
an application-ready product based on a 2005 snap- 
shot of the medium-resolution NHD, the National 
Elevation Dataset (NED) (USGS, 1999a), and the 
National Watershed Boundary Dataset (WBD) 
(Simley and Carswell, 2009). NHDPlus can be used 
to improve the ability to study cause-and-effect rela- 
tions in hydrologic processes and water quality at 
finer spatial scales (Alexander etal, 2007; USEPA 
and USGS, 2005; Moore etal, 2004). Regionally, 



NHDPlus provides significant spatial detail and a 
realistic representation of hydrologic pathways and 
numerous spatially referenced characteristics (attri- 
butes). Expanded capabilities of NHDPlus include 
updated reach-network connections and topology, 
30 m elevation-derived catchments and flow paths 
(Johnston etal., 2009), estimates of streamflow and 
velocity, and value-added attributes of spatially refer- 
enced landscape characteristics like land use and 
climate (USEPA and USGS, 2009). NHDPlus also 
integrates hydrologic features with other available 
data sources, including USGS streamgaging stations, 
the NED, and the WBD. The framework and subse- 
quent tools developed for NHDPlus (and in develop- 
ment for high-resolution NHD) provide the ability to 
customize the behavior of the stream network, in 
addition to building and including user-defined attri- 
butes (USEPA and USGS, 2005, 2009). Because the 
locations of surface-water features within NHDPlus 
are based on cartographic interpretations, mapping 
anomalies of surface-water features also exist in the 
dataset. These include isolated (unconnected) stream 
reaches. Some isolated reaches are real features, but 
many are an artifact of the quadrangle map-produc- 
tion process, where in some cases, streams stop at 
the edge of quadrangle boundaries. These occurrences 
are currently being identified and corrected. 

Associated NHDPlus attributes are stored in many 
external tables related to the surface-water features by 
common fields. The one-to-one and one-to-many rela- 
tionships between geographic locations of surface- 
water features and other associated attributes can be 
complex compared to the simplicity of a dataset like 
RF1, in which all associated attribute information is 
stored in one table. This complexity is due in part to 
the fact that NHDPlus was designed to meet many 
purposes and utilizes several external data sources. In 
addition, NHDPlus contains a vast number of records. 
The ability to identify and connect to the proper table 
in which the information of interest is stored can be 
somewhat challenging (USEPA and USGS, 2009). 

Scale is important when determining stream net- 
works for specific uses because scale will affect the 
number of streams defined (Alexander etal., 2009). 
The number of streams, in turn, can affect the ability 
to determine the true catchment composition, stream 
length, and stream density. These are all useful 
quantitative measures of a stream network and the 
hydrologic responses of drainage-basin transport 
capacity and instream decay rates for contaminants 
in various stream sizes (Horton, 1945; Schwarz et al., 
2006). In addition, the availability and limitations of 
digital spatial data used to define, refine, and charac- 
terize hydrologic networks can vary in geographic 
extent, content, and scale, thus contributing to the 
complexity of regional assessments (Fargas et al. , 
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FIGURE 1. Maps Showing the Variations in Spatial Detail Within 
and Between (a) RF1 and (b) Medium Resolution NHDPlus 
in the New England Area of the Eastern United States. 



1997; Fekete etal., 2001). Figure 1 demonstrates 
varying mapped stream densities between RF1 and 
medium-resolution NHDPlus in the New England 
area of the United States. When viewed from a regio- 
nal perspective, inconsistencies between mapped 
quadrangles are evident in the NHD geospatial 
dataset. Inconsistent stream densities within the 
region are also evident in the RF1 dataset. These dif- 
ferences in stream densities are artifacts of scale and 
map-production techniques. 

Continental- and global-scale elevation data, along 
with a variety of spatial analysis methods, have con- 
tributed to the ability to define and enhance hydro- 
logic networks at various scales and geographic 
extents (Montgomery and Foufoula-Georgiou, 1993; 
USGS, 1997; Fekete etal., 2001; Elliott etal., 2005; 
Verstraeten, 2006; Davies and Bell, 2009). Basic 
applications of generating hydrologic networks from 
DEMs rely on the direction of surface-water flow from 
each elevation cell (flow direction) and the accumula- 
tion of cells flowing into any given cell (flow accumu- 
lation). Flow direction represents the steepest 
downslope direction that water on the land surface 
would flow. Once this direction is known, the number 
of cells flowing into any given cell can be calculated. 
In addition, surface-water flow pathways and associ- 
ated catchment and watershed boundaries can be 
delineated (Jenson, 1991). One distinct advantage of 
DEM-based hydrographic analysis is that elevation- 
derived stream networks are not subject to poten- 
tially inconsistent cartographic interpretations. In 
addition, automated techniques used to generate 
stream networks and associated catchments can also 
be applied for repetitive procedures and outcomes. 
Such networks also are not limited by a fixed map 
scale like 1:100,000 or 1:500,000. However, they can 

JAWRA 



be affected by inherent errors in the DEM creation 
process, causing potential misrepresentations and 
interpretations of the landscape, especially in areas 
of low relief. Networks also can be limited by the cell 
size (resolution) of the elevation data. Depending on 
the cell resolution of the DEM, hydrologic features 
can become oversimplified, contributing to a loss in 
accuracy or spatial detail. Ambiguities among the 
choices of search algorithms selected to create the 
network also can contribute to limitations and varia- 
tions in interpretation. Consequently, stream net- 
works and associated catchments derived from 
elevation data only may be inconsistent with previ- 
ously accepted vector representations of hydrologic 
features (Saunders, 2000). Elevation-value adjust- 
ment is a potential remedy that addresses the mis- 
alignment of vector and DEM-derived networks to 
better align the hydrologic features with the topogra- 
phy of the landscape. This is the case when develop- 
ing catchments for enhanced versions of the RF1 (see 
Supplementary Information for details) and the 
NHDPlus (Johnston etal, 2009). The accuracy of 
the results can vary depending on the scale of the 
vector representations of surface-water features, cell 
resolution of the elevation data, and natural topo- 
graphic relief of the area (Saunders, 2000; Wilson 
and Gallant, 2000). Elevation points collected at very 
close intervals, such as light detection and range 
(LIDAR) data, may be more representative of the 
topographic relief. These data can be used to produce 
detailed stream networks at very large scales. How- 
ever, regional applications may be impractical using 
this type of data because of inconsistent data collec- 
tion over large areas. In addition, extremely large 
computer files can be generated for even a small area, 
creating potential data-management issues and prob- 
lematic results from network-generation algorithms. 



DIGITAL HYDROLOGIC NETWORKS 
SUPPORTING SPARROW 



A digital representation of a hydrologic network is 
the fundamental framework of the spatial infrastruc- 
ture supporting SPARROW models (Schwarz etal., 
2006). Collectively, the network of stream reaches 
and associated catchments form common spatial units 
used to frame aquatic and watershed characteriza- 
tions. Associated reach characterizations and trans- 
port properties are then used by SPARROW to 
provide detailed spatial evaluations of the factors 
and processes affecting the source and transport of 
contaminants throughout the river network and its 
drainage area (Schwarz et al., 2006). 
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The SPARROW model structure, in conjunction 
with spatially distributed characteristics within the 
hydrologic network, provides a statistical basis for 
empirically estimating stream-contaminant flux (pre- 
dictions) in unmonitored areas. The SPARROW meth- 
odology allows for separate statistical estimates of 
spatially referenced explanatory watershed character- 
istics that quantify the amounts of contaminant 
sources individually or collectively. These estimates 
are weighted by established relations between con- 
stituent mass and other geographically referenced 
physical factors affecting aquatic and terrestrial con- 
taminant supply, fate, and transport. Because of spa- 
tial referencing and network connectivity, estimates 
of contaminant flux can be quantified at any location 
along the network. These quantities of contaminant 
flux for each reach can be portrayed in specialized 
maps at multiple spatial scales to better interpret 
and visualize the contributions from individual 
contaminant sources. 

A stream reach is the basic building block of 
the modeling framework (Figure 2). It is defined by a 
single vector representing surface-water pathways. 
Each reach extends either from headwater to stream 
junction, or from one stream junction to another 
stream junction (Brakebill and Preston, 2003; 
Schwarz et al, 2006). Each reach is consistently 
oriented in the direction of streamflow and is usually 
connected to at least one other reach at its down- 
stream node. Nodes are endpoints of lines that 
maintain the identity, direction, and location of 
intersected linear features. They are defined by a 




FIGURE 2. Schematic Drawing Showing the Spatial and Topologi- 
cal Relations Between a Reach, the Network Topology, a Monitor- 
ing Site, and Associated Catchments. Each reach is labeled with a 
unique identifier (MRB_ID) and a hydrologic sequence number 
(HYDSEQ). In addition, the terminal ends of each reach are labeled 
with node identifier (NODE-ID) values. The node at the down- 
stream end of a reach is identified as the to-node (TNODE) and the 
node at the upstream end of a reach is identified as the from-node 
(FNODE). 



numbering system used to relate the upstream 
(FNODE) and downstream (TNODE) ends of con- 
nected stream segments (ESRI, 1992). This topological 
information is used to define reach-to-reach connectiv- 
ity where the upstream node of a reach has the same 
identification number as the downstream node of the 
reach just upstream. Instances of a reach not connect- 
ing to another reach at the downstream node include: 
reaches that are part of a naturally closed basin; iso- 
lated reaches where the surface-water pathway is not 
clear and therefore not mapped; or a reach that is 
determined to be terminal, representing a subjective 
end to the stream (surface-water) transport. 

Each reach also must be assigned a unique numer- 
ical sequence value (HYDSEQ) indicating its hydro- 
logic order, from headwater to its termination point. 
Sequencing in a downstream direction based on 
reach-node topology gives an order to data processing 
that must be followed to route water from each 
stream segment to the next downstream segment 
throughout the network. This allows applications uti- 
lizing the network to accumulate mass referenced to 
the network in downstream order. Terminal reaches 
are defined as the last transport reach in which 
applications utilizing the networking capabilities ter- 
minate. SPARROW models compute the amount of 
flux prior to entering a fixed termination point such 
as a reservoir, international political boundary, or a 
terminal receiving body of water like an estuary or 
the ocean (Schwarz et al, 2006). 

While the stream-reach network describes the lin- 
ear connection of surface-water pathways, the catch- 
ments defined by the area drained by each individual 
reach provide the ability to spatially reference land- 
based contaminant supply, load, transport, and load- 
prediction data. The catchments are typically derived 
from DEM data at a variety of resolutions ranging 
from 30 m to 1 km cell sizes, depending on the 
geographic scale of the application (Brakebill and 
Preston, 2003; Moore et al, 2004; Alexander et al, 
2008; Hoos et al, 2008;). Catchments are important 
to SPARROW models because they provide the 
spatial foundation to geographically reference explan- 
atory parameters in manageable units and in spatial 
detail. In addition, delineated drainage catchments 
provide the ability to calculate required model para- 
meters like local catchment drainage area and accu- 
mulated total upstream watershed drainage area. 
These parameters are used to normalize explanatory 
transport factors, quantities of contaminant supply, 
and loads predicted by the models. Additional details 
on catchment delineation supporting SPARROW mod- 
eling can be found in the Supporting Information of 
this article. 

Independent of hydrologic sequencing, an addi- 
tional numbering system unique to each individual 
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modeling effort identifies each reach as a single lin- 
ear spatial unit. This unique number is also shared 
by the catchment area drained by the reach, thus 
spatially linking the hydrologically connected streams 
and the respective drainage-area characteristics. This 
provides a numerical accounting system to aggregate 
spatially referenced GIS data layers to common spa- 
tial units. In addition, other relevant physical, envi- 
ronmental, and monitoring information can be 
associated to the common network and accessed using 
the unique identification number. 

Most initial foundations of digital hydrologic net- 
works supporting aquatic transport in SPARROW 
models are based on existing vector stream-reach net- 
works such as RF1 and NHD. Enhancements to the 
datasets have been adopted in order to accommodate 
specific regional and national modeling applications. 
These enhancements include catchment generation 
for each stream reach, and topological corrections and 
attribute additions to the stream network such as 
stream monitoring, mean-annual flow estimates, and 
reservoir information (Smith et al., 1997; Alexander 
etal., 1999, 2008; Nolan et al., 2002; Brakebill and 
Preston, 2003; McMahon etal., 2003; Hoos etal., 
2008). The USGS National Water Quality Assessment 
(NAWQA) Program directed further enhancements to 
RF1 (MRB_E2RF1) in support of regional Major River 
Basin (MRB) SPARROW modeling presented in the 
SPARROW Featured Collection issue. These enhance- 
ments included the association of over 3,000 stream 
water-quality monitoring sites to stream reaches and 
updating estimated annual flows for each reach, later 
described in the application section of this article. 
Additional details related to these enhancements can 
be found in the Supporting Information of this article. 

The NHD geospatial dataset was initially chosen 
to support nutrient SPARROW models in the North- 
east region of the United States (MRB1) because 
RF1 in this part of the country contained inconsistent 
variations in stream density and lacked spatial 
detail (Moore et al., 2004). However, in order to sup- 
port regional SPARROW modeling applications, 
enhancements to NHD also were required. This 



included developing reach-node topology for stream 
routing and navigation, generating drainage catch- 
ments, and developing required model attributes like 
flow and velocity estimates and feature attributes 
such as land use. These requirements served as a cat- 
alyst for developing the NHDPlus dataset described 
earlier. Nationally, the NHDPlus dataset provides 
greater spatial detail for the locations of surface-water 
features, a larger number of stream reaches, and a 
smaller average catchment size than that of the modi- 
fied RF1 dataset (Table 1). In the near term, the 
NAWQA Program intends to adopt NHDPlus as the 
foundation for future spatial networks supporting 
regional MRB SPARROW modeling. In addition, 
future snapshots of the NHDPlus are planned. Com- 
plexity is added because NHDPlus includes three 
ingredient databases (NHD, NED, and WBD) that 
continue to evolve and change. Each of these databas- 
es is maintained separately, but by tying NHDPlus 
updates into the maintenance structure of these core 
databases, SPARROW modelers can contribute 
improvements they make to the reach network. These 
updates could then be incorporated in future versions, 
as well as become available to the larger community. 

A key network requirement for SPARROW model- 
ing is the ability to connect pathways and associated 
characteristics hydrologically (Schwarz etal., 2006). 
The spatial infrastructure of a SPARROW model 
developed partially or entirely from DEM-derived 
stream networks and flow paths across the landscape 
may provide more flexibility and spatial detail, thus 
furnishing a more precise framework to aggregate 
environmental explanatory and monitoring informa- 
tion at finer scales (Brakebill and Preston, 2003; 
Elliott et al., 2005). The flow paths provide the hydro- 
logic connection and essentially relieve the necessity 
for stream-node topology. This allows SPARROW to 
evaluate relations of sources and overland-transport 
properties to monitored streams at a grid-cell level as 
opposed to generalized or simplified catchment or 
watershed level, thereby enhancing the model's spatial 
detail. Elevation and flow-path information associated 
with the NHDPlus dataset currently provides this 



TABLE 1. Comparision of the Spatial Resolution of Mapped Features in RF1 and NHDPlus. 
This table also compares estimates of mean annual flow and catchment size. 



Number of 
Geospatial Catchments 
Dataset With Area 



Number of 
Reaches With 
Mean Annual 
Flow Estimates 



Number of Mean Catchment Mean Annual Median Flow Mean Reach 



Flowlines 



Size (W) 



Flow (m 3 /sec) (m 3 /sec *) 



Length (km) 



RF1 

NHDPlus 



66,147 
2,595,196 



63,018 
2,606,662 



64,696 
2,342,519 



129.7 
3.1 



68.9 
11.1 



2.0 
0.03 



16.0 
2.2 



Notes: Mean annual flow for geospatial dataset RF1 represents original estimates. The number of flowlines represents the number of 
nonshoreline reaches. 
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capability at a 30 m resolution (Schwarz et al., 2006; 
USEPA and USGS, 2005; Johnston et al., 2009). 

Spatial Referencing 

A digital network managed within a GIS is essen- 
tial to the SPAEROW approach. In addition to model- 
required topological properties, the network provides 
a means to geographically locate stream-reach char- 
acteristics, independent (i.e., measured) response 
variables, explanatory variables, and predicted quan- 
tities of contaminant flux. Estimates of mean-annual 
streamfiow and water velocity, reach length, mean- 
annual water travel time, reservoir surface area, and 
stream-reach type are examples of required charac- 
teristics referenced to the network and supplied 
to SPARROW models. Annual stream constituent- 
loading estimates serve as an independent response 
variable for SPARROW model calibration. Spatially 
referenced explanatory landscape, subsurface, and 
aquatic characteristics locate and identify quantities 
or surrogates of contaminant supply and the factors 
and processes associated with contaminant transport 
throughout the watershed. Geographically locating 
estimates of load (mass) on the reach network forms 
the basis to establish relations between constituent 
mass and the geographically referenced physical fac- 
tors that may affect supply, fate, and transport 
(Smith et al., 1997). 

Mean-annual streamfiow estimates are derived 
from preexisting estimates within the network data- 
sets or reestimated for stream reaches using methods 
described in the applications section of this article. 
Each stream reach also is identified as either a reser- 
voir or nonreservoir reach so that the appropriate 
aquatic loss rate can be estimated and applied to the 
modeled contaminant. Independent response vari- 
ables used to calibrate SPARROW models are typi- 
cally derived from station-specific empirical relations 
between individual water-quality concentrations, con- 
tinuous stream-discharge data, and time. At each of 
the monitoring stations, which are geographically 
located on the appropriate stream reach, data are col- 
lected at a wide range of stream sizes by several state 
and federal agencies. Each unique monitoring-station 
identification number is maintained, allowing for 
additional station-specific analysis. Once associated 
with a stream reach, the connections within the 
reach network allow for systematic climbing 
upstream from each monitoring station, thereby asso- 
ciating the downstream measured load with every 
upstream reach until the next monitoring station is 
encountered. This ability to associate the monitoring 
station with upstream reaches ensures that the 
upstream sources of contaminants are accounted for 



in the observed load measured at the downstream 
monitoring location. Nodes can be placed at monitor- 
ing locations or at shorelines of impoundments for 
more accurate associations and assessments. Placing 
nodes at each monitoring location on a reach ensures 
that load estimates used for SPARROW model cali- 
bration are referenced to the downstream end of a 
reach, thus providing detailed spatial referencing. By 
adding a node at the monitoring location, the reach 
may be split (segmented) into two separate stream 
reaches, thereby creating a new reach upstream of 
the sampling site and maintaining the reach connec- 
tivity. In some cases, this approach may be desirable 
but impractical, and the monitoring site instead is 
associated with the closest downstream node of the 
reach (Figure 2). 

The spatial arrangement and the variable charac- 
teristics identified within a hydrologic network can be 
important when evaluating spatial relations related to 
water quality and hydrology at regional scales 
(Verstraeten, 2006). Geospatial data layers 
representing these characteristics are critical to the 
evaluations using SPARROW. Geospatial data are 
merged with individual reach-catchment boundaries to 
spatially reference measures of contaminant supply 
and transport properties within a watershed. These 
quantitative values are then used by SPARROW as net- 
worked explanatory variables in the regression models. 
Contaminant sources can be viewed or evaluated in 
terms of their spatial distribution, relative supply con- 
tribution, and potential for transport to downstream 
waterways. Nutrient-source examples include point 
sources, land use, atmospheric deposition, and com- 
mercial fertilizer and manure applications (Schwarz 
et al. , 2006). Landscape and subsurface characteristics 
throughout the watershed also are merged with the 
catchments. These explanatory watershed characteris- 
tics are typically computed as mean values for each 
stream reach or normalized by catchment or watershed 
area. They represent physical properties and processes 
affecting the transport of contaminants as they move 
across the landscape and into streams. Examples 
include soil properties, slope, precipitation, and tem- 
perature (Schwarz et al., 2006). 

Geospatial data sources representing explanatory 
properties vary in detail, spatial extent, and scale. 
Data layers from local sources may be more spatially 
detailed and have more specific information content 
than sources developed at a regional or national scale. 
Land-use data, for example, exist in various media, 
temporal and physical scales, and classification 
schemes for many local communities. However, com- 
piling a land-use dataset for a regional area using 
locally derived data would be impractical because 
of these variations. Agricultural census data are 
another example of a dataset with spatial variations 
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in information content. Some relevant agricultural 
data are collected at the farm-field level but, for pri- 
vacy reasons, are distributed to the public only as 
county-level information. Simply apportioning the 
same county explanatory data at finer resolutions 
based on aerial distributions within the network does 
not necessarily improve the quality of that informa- 
tion; however, methods do exist that allocate general 
explanatory information to finer-scale land-use data 
within a county and catchment (Schwarz et al. , 2006). 

When developing a network intended to support 
SPARROW applications, one should consider the spa- 
tial extent, resolution, and the reasonable representa- 
tions and variability of watershed characteristics, 
monitoring, and constituent-source data within the 
modeled area (Schwarz et al, 2006). Information and 
detail are inherently generalized or simplified when 
explanatory data are spatially referenced to the 
hydrologic network. A network that is too coarse in 
scale relative to the explanatory information may 
negate, neglect, or oversimplify processes, sources, 
and effects on water quality. A network too detailed 
may not be able to distinguish any differences or 
detect any significant importance because of the dif- 
ferences in spatial detail between the network and 
the explanatory information. In addition, the constit- 
uent monitoring associated with the network also 
needs to be spatially detailed enough or representa- 
tive of the varying conditions to detect variability in 
the explanatory information used in the model. 

Aquatic Transport and Decay 

Streams, lakes, and reservoirs are known to trap 
sediment and contribute to nutrient losses (Chapra, 
1997; Boyer et al, 2006; Schwarz, 2008; Alexander 
et al. , 2009). Model-literature data suggest that physi- 
cal properties of aquatic systems are related to nutri- 
ent loss rates and can be generalized over broad 
spatial scales (Boyer et al, 2006). Because SPARROW 
statistically relates upstream contaminant sources to 
observed downstream loads, the explicit spatial struc- 
ture defined by the stream portion of the network per- 
mits the simulation of material loss due to aquatic 
transport processes such as sedimentation and deni- 
trification in streams, lakes, and reservoirs. These 
processes facilitate the accumulation or loss of associ- 
ated contaminants and predicted quantities from each 
reach as they move downstream (Schwarz et al. , 2006) 
and are important components of contaminant mass 
balances in watersheds. Rates of instream loss or stor- 
age of constituent mass are computed in SPARROW 
as a function of streamflow and time of travel. 
Instream loss rates are estimated statistically and, for 
nonreservoir reaches, as a first-order decay rate for 



each stream class determined by the mean-annual 
streamflow estimate for each reach. Time-of-travel 
estimates are calculated as a ratio of reach length to 
mean stream-water velocity (Smith et al, 1997; Seitz- 
inger et al, 2002; Boyer et al, 2006). However, SPAR- 
ROW models also permit stream channels to be 
considered a source of contaminant, such as in the 
case of phosphorus or suspended sediment (Schwarz, 
2008; Brakebill et al, 2010; Brown et al, this issue). 

SPARROW supports the designation of lakes and 
reservoirs spatially referenced to the network by 
locating impoundments on their corresponding 
stream reaches. Reaches are coded with an indicator, 
where 0 identifies a stream reach, 1 is an impound- 
ment reach, and 2 is an outlet reach. The TNODE of 
the outlet reach type represents the impoundment 
location, and can be related to additional impound- 
ment information such as surface area. Lakes and 
reservoirs also can be identified by determining the 
catchment in which they reside; this provides spatial 
referencing of impoundments that may be more 
detailed than the reach network. Aquatic loss or stor- 
age in reservoirs along the reach network is quanti- 
fied in SPARROW as a settling velocity in units of 
length per time. The settling velocity is calculated as 
a function of the ratio of the outflow rate of the outlet 
reach (estimated mean-annual flow) and the surface 
area of the reservoir (Schwarz et al, 2006). Lakes or 
reservoirs not on the reach network can be associated 
with the appropriate catchment and used as an over- 
land land-to-stream transport factor, computed as the 
ratio of the number of impoundments to catchment 
area (Brakebill et al, 2010). Major sources of reser- 
voir information include Ruddy and Hitt (1990) and 
the National Inventory of Dams (USACOE, 2005). 

Predictions 

SPARROW can be used to predict flux for each 
modeled reach, including unmonitored locations. 
These estimates are a function of the established lin- 
ear relations between monitored contaminant flux 
and the quantities of constituent supply referenced to 
the network (Schwarz et al, 2006). Each stream 
reach and associated catchment is treated as an inde- 
pendent unit, quantifying the amount of contaminant 
mass generated within the catchment area and trans- 
ported to the end of each stream reach. Contaminant 
mass that is generated locally for each stream reach 
is weighted by the amount of instream loss that 
would occur during aquatic transport. The cumula- 
tive loss of contaminant mass from its source through 
its continued transport downstream is dependent on 
the travel time and instream or reservoir loss rate 
of each individual reach. Mass-balance properties 
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FIGURE 3. Map Showing the Percentage of Stream Nutrients Delivered to the Gulf of Mexico From the Incremental Drainage Based 
on SPARROW Predictions: (A) Total Nitrogen; (B) Total Phosphorus (from Alexander et al., 2008, figure 3, supplemental information). 



maintained by SPARROW provide a basis for flux 
accounting, whereby predicted flux can be allocated 
to its various upstream sources both geographically 
and by source type. Specialized maps make it possible 
to visualize contaminants discharged to estuaries 
attributed to specific sources from which they drain, 
providing guidance in managing the reduction of con- 
taminant fluxes (Brakebill and Preston, 2004; Moore 
et al, 2004; Alexander et al., 2008) (Figure 3). 

Application 

The ability to describe, route, and simulate the 
transport of constituents throughout the landscape is 
a key component to hydrologic analysis and modeling 
(Maidment, 1993). Flow characteristics of hydrologic 
networks are often the driving mechanism for model- 
ing the transport of constituents such as nutrients 
and sediment. Unless flow characteristics can be 
determined reasonably well, the simulation of trans- 
port may be difficult (Moore et al., 1991). 

The RFl-based spatial framework (MRB_E2RF1) is 
well suited for enhanced attribution of features like 
annual streamflow critical to SPARROW models. The 
streamflow estimates originally associated with RF1 
reaches were computed in 1982 by W.E. Gates and 
Associates, under contract with the USEPA (Horn 
etal, 1994; USEPA, 1996). The methods are 
described in an unpublished report "Estimation of 
streamflows and the reach file." Incorrect reach topol- 
ogy and inappropriate stream connections at various 
locations within the RF1 stream-reach network have 
been corrected when identified. Associated drainage 
catchments also have been created to reflect these 
changes. However, these deficiencies in the stream- 
reach network may have affected the original stream- 
flow estimates with unexpected results. Therefore, 



the need for more accurate, current, and documented 
estimates led to the effort, described below, to develop 
a method to update streamflow estimates for each 
reach in the dataset that could subsequently be used 
in SPARROW modeling (Brown et al., this issue). 
This methodology was designed such that it also 
could be used to improve the accuracy of the current 
estimated flows in more spatially detailed datasets 
like the NHDPlus. 

In the application described here, long-term aver- 
age-annual streamflow estimates for each of the 
approximately 60,000 reaches were computed for the 
period 1971-2000. The method incorporates the cal- 
culation of runoff (flow per unit area) for hydrologic 
cataloging units on the basis of historical flow data 
collected at USGS streamgaging stations, estimation 
of flow delivered to individual stream reaches from 
their local catchments, and the use of the enhanced 
reach network to accumulate streamflow downstream 
to terminal locations. A local bias correction in the 
flow estimation then is applied by calculating the dif- 
ference between estimated and measured flow at 
streamgaging stations and then interpolating the bias 
throughout the river network. The entire process 
is repeated for each individual water year during 
the period 1971-2000. A water year is defined as 
the period from October 1 to September 30, and the 
water-year designation (e.g., 1971) corresponds to the 
year of the ending date (e.g., September 30, 1971). 

Computation of Hydrologic-Unit Runoff 

Estimates of hydrologic-unit runoff were generated 
by combining historical flow data collected at USGS 
streamgaging stations, the respective drainage-basin 
boundaries of the streamgaging stations, and the 
boundaries of the 2,110 hydrologic units. Streamgaging 
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stations for the analysis were selected for each water 
year based on the availability of a complete daily flow 
dataset for the water year. Geospatial data represent- 
ing drainage-basin divides from the location of each 
streamgaging station were delineated using the 
NHDPlus dataset and the accompanying digital- 
elevation-model-based flow-direction information 
(USEPA and USGS, 2005). Basin boundaries with a 
computed drainage area within 25% of the stream- 
gage drainage-basin area reported in the USGS 
National Water Information System (NWIS) (USGS, 
2008) were considered valid for this analysis. In a 
typical water year during the period 1971-2000, 
there were about 6,000 streamgaging stations with a 
complete daily flow dataset and an acceptable drain- 
age-basin boundary. The drainage-basin areas of 
these streamgaging stations ranged from 10 to 
180,000 km 2 with a median value of 3,000 km 2 . 

Hydrologic-unit subbasins and their associated 
eight-digit accounting numbers (HU-8s) are a widely 
used geographic framework for the conterminous 
United States. Each unit defines a geographic area 
representing part or all of a surface-drainage area or 
a combination of drainage areas. Subbasins subdivide 
larger accounting units (HU-6s), subregions (HU-4s) 
and regions (HU-2s) into smaller areas designated by 
the U.S. Water Resources Council and the USGS's 
National Water Data Network. Subbasins range in 
size from 24 to 22,808 km with a median value of 
3,133 km 2 (Seaber etal., 1987; Steeves and Nebert, 
1994). 

Figure 4 illustrates the method used to compute 
runoff estimates for HU-8s. The first step is to com- 
pute runoff values (flow per unit area) for each 
streamgage basin by dividing the average daily flow 
for the water year by the delineated basin area. In a 
hypothetical example (Figure 4), runoff is estimated 



at two streamgaging stations (labeled A and B in Fig- 
ure 4) by dividing the average daily flow measured at 
each of two streamgaging stations by their respective 
drainage-basin areas. (The drainage area of basin A 
is shaded light gray and the drainage area of basin B 
is shaded dark gray. Note that drainage basin B is 
nested within drainage basin A.) 

Each geospatial basin boundary is then overlain on 
a geospatial dataset of HU-8s (the polygons outlined 
in bold black lines) to determine the area of intersec- 
tion within the two datasets. For each overlapping 
area of HU-8s and drainage-basin boundaries, the 
fraction of the basin in the HU-8 and the fraction of 
the HU-8 in the basin are calculated. These fractions 
are then multiplied by each other to compute a 
weighting factor for each basin. The runoff values 
and associated weighting factors for all basins with 
any overlapping area with a HU-8 are combined, and 
a single weighted-average runoff value is computed 
for the HU-8 (Figure 4). 

The weighted-average runoff computations illus- 
trated in Figure 4 were repeated for all combinations 
of the roughly 6,000 basins and 2,100 hydrologic cata- 
loging units (HU-8s). Runoff values for HU-8s that 
had no overlapping areas with streamgage basins 
were computed as the mean of the HU-8 runoff val- 
ues within the same HU-4 (subregional unit). 

Intersection of HU-8 Runoff With Reach Catchments 

The amount of flow delivered to each reach from 
its uniquely identified catchment was computed by 
intersecting the HU-8 runoff grid with MRB_E2RF1 
catchments. Runoff values for each grid cell were 
summed based on the spatial intersection of the two 
datasets. The sum of grid-cell runoff values within 
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K • 



Basin 



Area (km 1 ) 100 250 

Fnctionl: Basin in HU 0.4 

Fraction2: HU in basin 1.0 

Weight (Fraction 1 x Fraction2) 0.4 

Streamflow (cfsl 14 

Runoff (mm/year) 50 



_B 
10 
1.0 
0.1 
0.1 

1.1 
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HUl runoff = [ (Weight Ax Runoff A) • (Weight B x 
Runoff B) | / (Weight A ♦ Weight B) 

= [(0.4 x 50) ♦ (0. 1 x 100)) / (0.4 + 0.1) 
= 60 mm/year 



bos in A outlet 



Basin B outlet 



FIGURE 4. Runoff Computation for a Hypothetical Hydrologic Unit. Two basins are shown as filled polygons: basin A is light gray 
and basin B, which is nested in basin A, is dark gray. Hydrologic cataloging unit boundaries (HU-8s) are indicated by bold gray lines 
and one hydrologic unit is labeled HUl. Streams and streamgage locations are shown as thin gray lines and black dots, respectively. 
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each catchment has units of runoff [L-T -1 ] multiplied 
by area [L ] and can be expressed in common units of 
flow, such as cubic feet per second (ft 3 /s), by applying 
the appropriate units conversion. The catchment flow 
values were assigned to their respective stream 
reaches based on the unique identifier shared 
between the reaches and the associated catchments. 



Accumulation of Flow Downstream Through 
the Network 

Flow delivered to the stream network from the 
catchments was accumulated in the downstream 
direction by using topological information associated 
with the MRB_E2RF1 reach network (Figure 2). The 
topology of the network defines the connections and 
flow directions of the stream segments. This allows 
any characteristic associated with the stream seg- 
ments to be accumulated throughout the river net- 
work from the most upstream reaches (headwaters) 
to the most downstream reaches (coastal or inland 
terminal segments). In the case of accumulating flow 
throughout the reach network, the flow from all 
headwater reaches (HYDSEQ = 1 and HYDSEQ = 2 
in Figure 2) is added to the flow of the next down- 
stream river segment (identified as HYDSEQ = 3 in 
Figure 2). The correct downstream segments are 
determined by matching the TNODE of the upstream 
reach to the FNODE of the downstream reach. After 
the flow from all headwater reaches has been added 
to all the neighboring downstream reaches, the pro- 
cess is repeated for successively increasing HYDSEQ 
numbers until the terminal ends of the network are 
reached (Schwarz et al, 2006). 

Removal of Local Bias in Estimated Flow 

The accuracy of the accumulated estimated flow was 
evaluated by comparing measured and estimated flow 
values at streamgaging stations. One way to visualize 
the accuracy is by tracking measured and estimated 
flow along main-stem river corridors. Two examples of 
main-stem corridors, the Hudson River and the Colo- 
rado River, are shown in Figure 5. Agreement between 
the estimated (solid line in Figure 5A) and measured 
(black squares) flows is reasonable along the Hudson 
River from its headwaters to its downstream terminal 
end at New York Bay, although the original estimates 
are higher than the streamflow measured at stream- 
gaging stations. In contrast, the measured (black 
squares) and estimated (solid line) flows diverge signif- 
icantly along the Colorado River (Figure 5B), except in 
the headwaters. The poor performance of the model 
along the Colorado River occurs because the routing 



approach assumes that flow is strictly gaining (conser- 
vative accumulation) through the river network. In 
other words, all flow delivered to the network from the 
catchments accumulates with no losses as water flows 
downstream. This assumption is reasonable for the 
Hudson River but clearly not so for the Colorado River. 
In streams that naturally "lose" flow, the water-table 
elevation adjacent to the channel often is lower than 
the water surface of the river. Water then flows down- 
ward and laterally away from the stream channel, 
resulting in a net loss of streamflow; this "lost" water 
either recharges the groundwater system or evapo- 
rates. Water also commonly is lost from a stream reach 
due to withdrawals for irrigation, public supply, and 
other human water needs (Hancock, 2002; Prudic 
et al, 2006; Rushton, 2007). 

The local bias in accumulated flows was quantified 
by calculating the difference between the estimated 
(solid lines in Figure 5) and measured (black squares) 
flows at streamgaging stations and then interpolating 
the bias throughout the stream network using the 
associated topological information (Perry et al., 2004). 
The HYDSEQ, FNODE, and TNODE values were 
used in a FORTRAN computer program to identify 
upstream and/or downstream gaging stations for 
each reach. The most similar upstream and down- 
stream gaging stations were identified for each reach 
based on drainage-basin area. 

If a streamgaging station was present upstream 
but not downstream of a stream segment, then the 
bias at ungaged segment s was computed as: 

B s = (B u /A u )xA s x(A u /A s ), (1) 

where B s is the bias at ungaged segment s, B u is the 
bias at the upstream station, A u is the drainage area 
of the upstream station, and A s is the drainage area 
of ungaged segment s. 

The bias correction method (Equation 1) is a heu- 
ristic approach in which the bias per unit area 
(B u /A u ) at the upstream gaging station is multiplied 
by the drainage area of the ungaged segment (A s ) 
and then weighted by the ratio of the drainage area 
of the upstream station (A u ) to the drainage area of 
the ungaged segment. The ratio A u /A s approaches a 
value of 1 when the drainage areas of the upstream 
gaging station and ungaged segment are very similar, 
and A u /A s approaches a value of 0 when the drainage 
area of the upstream gaging station is much less than 
that of the ungaged segment. This method reflects 
the assumption that the estimated bias measured 
at the upstream gaging station should be given sig- 
nificant weight at ungaged segments near the station, 
but the bias measured at the upstream gaging station 
should have less weight when applied to ungaged 
segments far downstream from the gaged site. 
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FIGURE 5. Estimated and Measured Daily Average Streamflow (water-year 2000) for River-Reach File 1 (RF1) Stream Reaches Along 
the Main-Stems of the (A) Hudson River and (B) Colorado River. The original flow estimate (sold line) is calculated by first intersecting the 
hydrologic unit runoff grid with MRB_E2RF1 catchments and then accumulating the flows downstream through the river network. The 
adjusted flow (dashed line) is computed by comparing the original flow estimates to flow values measured at streamgages, calculating the 
bias in the original flow estimate, and then interpolating the bias throughout the river network. 



It should be noted that the bias correction is not 
constrained to conserve mass. 

If there was a downstream gaging station but no 
upstream station, then the bias at ungaged segment s 
was computed as: 

B s = (B d /A d )xA s x(A s /A d ), (2) 

where B s is the bias at ungaged segment s, B d is the 
bias at the downstream station, A d is the drainage 
area of the downstream station, and A s is the drain- 
age area of ungaged segment s. 

Equation (2) is similar in concept to Equation (1). 
The bias per unit area (B d /A d ) at the downstream 
gaging station is multiplied by the drainage area of 
the ungaged segment (A s ) and then weighted by the 



ratio of the drainage area of the ungaged segment to 
the drainage area of the downstream station (A d ). 
The ratio A s /A d approaches a value of 1 when the 
drainage areas of the downstream gaging station and 
ungaged segment are very similar, and A s /A d 
approaches a value of 0 when the drainage area of 
the downstream gaging station is much greater than 
that of the ungaged segment. 

If both upstream and downstream gaging stations 
are present, then the bias at ungaged segment s was 
computed as: 

B s = [B u (A d -A s )+B d (A s -A u )]/(A d -A u ), (3) 

where B s is the bias at ungaged segment s, B u is 
the bias at the upstream gage, B d is the bias at 
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the downstream gage, A d is the drainage area of the 
downstream gage, A u is the drainage area of the 
upstream gage, and A s is the drainage area of 
ungaged segment s. Equation (3) interpolates the 
bias between two gages according to differences in 
drainage areas of the upstream gage, the down- 
stream gage, and the ungaged segment. For example, 
when the drainage area of the ungaged segment is 
equal to the drainage area of the upstream or down- 
stream gage, then the bias at the ungaged segment 
is equal to that of the upstream or downstream gage, 
respectively. When the ratio of the drainage area of 
the ungaged segment is exactly half that of the 
downstream gage and twice that of the upstream 
gage, then the bias measured at the upstream gage 
is given equal weight to that of the downstream 
gage. 

Addition of the interpolated bias to the accumu- 
lated estimated flows throughout the river network 
generates bias-corrected flow estimates (the dashed 
lines in Figure 5) that exactly match the measured 
values at streamgaging stations. In the river 
reaches between streamgaging stations, the bias- 
corrected flow estimates produce a smooth pattern 
consistent with the streamflow values measured at 
the streamgaging stations. The difference between 
the original and adjusted flow estimates is most 
noticeable in arid-region river corridors, such as 
along the Colorado River, where losing stream 
reaches are common due to human activities and 
natural processes. 

The entire procedure (estimation of runoff, inter- 
section with catchments, accumulation of flows in the 
reach network, and removal of local bias) was applied 
to all stream reaches for each individual water year 
from 1971 to 2000 and then the average was com- 
puted for the entire 30-year period. 

The ability to estimate mean-annual streamflow at 
ungaged reaches illustrates the benefits of using the 
river network as a component of the flow-estimation 
technique. The river network explicitly provides 
spatial connections between the ungaged reaches 
and the gaged streams. As represented by the bias- 
correction equations, knowledge of flow conditions 
upstream and downstream of ungaged reaches 
improves flow estimates compared to methods that do 
not incorporate network connectivity. These equations 
represent a type of spatial interpolation that is unique 
to river networks and cannot be approximated by sim- 
ple Euclidean spatial methods. In a network-based 
interpolation, proximity between a streamgage and an 
ungaged reach is determined by upstream and 
downstream position within the network. In addition, 
similarity between the gaged and the ungaged reach 
is a function of the ratio of drainage areas, not linear 
distance. 



SUMMARY 



Regional-scale digital hydrologic networks used 
within the United States typically have been con- 
structed from elevation points and hydrologic features 
collected by national mapping programs. The net- 
works comprise hydrologically connected stream- 
reach segments depicting surface-water pathways and 
their associated drainage catchments. They can pro- 
vide a consistent framework for descriptions and char- 
acterizations of aquatic and watershed processes 
controlling the supply, fate, and transport of constitu- 
ents. Topological properties inherent to the networks 
provide the ability to simulate the movement of water 
and associated constituents. Collectively, these net- 
work characteristics are key components to hydrologic 
analysis and modeling. One such modeling application 
is SPARROW, a hybrid statistical approach that 
establishes relations between monitored contaminant 
flux, contaminant sources, aquatic transport pro- 
cesses, and the physical characteristics affecting 
transport. Explanatory properties associated with the 
network are evaluated to assess their significant 
contribution to supply and transport relative to the 
monitored flux. 

Nutrient SPARROW models presented in this issue 
have been developed to address water-quality issues 
throughout the conterminous United States. Support- 
ing these regional models, are two separate digital 
hydrologic networks derived from the 1:500,000 River 
Reach File and 1:100,000 NHD (medium-resolution, 
enhanced to create NHDPlus). RF1 networks have been 
enhanced and modified to support various applications 
of SPARROW modeling, both regionally and nationally. 
The current national geospatial dataset contains topo- 
logical and stream characteristics necessary to execute 
a SPARROW model. These characteristics reside in a 
single table and include reach topology, hydrologic 
sequencing, reach identification, mean-annual stream- 
flow, velocity, travel time, reach length, reservoir 
surface area, and stream-reach type. Delineated drain- 
age areas for each reach provide the ability to spatially 
link watershed characteristics to the stream network. 
Because of a simplistic structure, the RFl-based net- 
work is a viable choice for a hydrologic network 
supporting applications like SPARROW. It provides an 
adequate representation of hydrologic features and 
pathways at a regional or national scale. 

The need for accurate and updated stream charac- 
teristics prompted the use of the modified RF1 spatial 
framework to generate annual streamflow for each 
reach from 1971 to 2000. These updated flows were 
subsequently used in the regional SPARROW models 
presented in this issue. The equations used for the 
flow-estimation technique represent a type of spatial 
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interpolation that is unique to river networks. 
Spatially explicit connections between ungaged and 
monitored locations provide knowledge of flow condi- 
tions upstream and downstream of ungaged reaches. 
In addition, similarities between the gaged reach and 
the ungaged reach are computed as a function of the 
ratio of associated drainage areas, not linear dis- 
tance. This methodology was constructed such that it 
also could be used to improve the accuracy of the 
current estimated flows in more spatially detailed 
datasets like the NHDPlus. 

NHDPlus is an application-ready product based on a 
2005 snapshot of three core national databases: NHD, 
NED, and WBD. Regionally, NHDPlus provides 
significant spatial detail (1:100,000 medium-resolu- 
tion) and a realistic representation of hydrologic path- 
ways and numerous spatially referenced landscape 
and climatic characteristics. NHDPlus also contains 
appropriate topological and stream characteristics nec- 
essary to execute a SPARROW model. Additionally, an 
expanded capability of NHDPlus provides 30 m 
elevation-derived catchments and flow paths useful for 
more detailed spatial referencing and network genera- 
tion. NHDPlus also benefits from a multi-agency 
coordination of NHD. Specific protocols for updating 
tools, features, and related information can potentially 
minimize any duplication of effort among stewards, 
incorporate corrections applied by SPARROW model- 
ers, and provide a consistent dataset to the greater 
community. 

NHDPlus was constructed to meet a variety of needs 
and applications. Therefore, the complexity of NHD- 
Plus resides in the components that also are dependent 
on changing and evolving databases like NHD, NED, 
and WBD. NHDPlus also contains a vast number of 
records in which relations between surface-water flow 
paths and other associated characteristics are not as 
simplistic as in RF1. Many external tables contain 
numerous attributes related by common fields to the 
spatial locations of hydrologic features. Standardized 
tools or scripts could automate and simplify procedures 
for establishing these relations and extracting the data 
for a specific use in models like SPARROW. 

As with many stream networks, the source infor- 
mation for mapping the hydrologic features of NHD- 
Plus were based on cartographic interpretations. 
Mapping anomalies such as varying stream densities 
and isolated or unconnected reaches currently are 
being identified and corrected. One approach to 
improving inconsistencies in drainage density may be 
to remove streams in the smallest catchments. 

Hydrologic networks developed from elevation data 
may provide more flexibility for transport models like 
SPARROW. Elevation-based flow-path networks allow 
for the option of processing environmental information 
at a grid-cell level as opposed to a catchment, reach, or 



watershed level. This may provide a more precise way 
to aggregate source and transport characteristics and 
take advantage of additional monitoring and environ- 
mental explanatory information at finer scales. Like 
NHDPlus, the flow-path networks can be improved 
using drainage-enforcement techniques and the exist- 
ing vector-based hydrography. NHDPlus also provides 
the necessary features that can be used to construct 
and evaluate a cell-based network supporting SPAR- 
ROW modeling. In addition, detailed elevation data 
are becoming more readily available for larger areas. 
It also is feasible to assemble networks at various 
adjustable scales based on streamflows estimated by 
the techniques described in this article. 



SUPPORTING INFORMATION 



Additional supporting information can be found in 
the online version of this article and is intended to sup- 
ply the reader with additional information on the devel- 
opment of hydrologic networks supporting regional 
nutrient SPARROW models and the NAWQA Program. 

Figure SI. Schematic outline of the processes used 
to generate catchments for each MRB_E2RF1 reach. 

Please note: Neither AWRA nor Wiley-Blackwell 
is responsible for the content or functionality of 
any supporting materials supplied by the authors. 
Any queries (other than missing material) should be 
directed to the corresponding author for the article. 
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